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Declaration 



I, Shizuo AKIRA, declare and say as follows: 

1. I am a citizen of Japan, residing at Zushi, 1-7-16, Takatsuki, 
Osaka, Japan. 

2 . I received a Bachelor of Medicine Degree from Osaka University 
in 1977, and a Doctor of Medicine Degree from Osaka University in 1984 . 

3. From 1980 to the present, a period of 25 years, I have been 
directly involved in various aspects of immunological and molecular 
biological research, including mechanism for recognition of pathogens 
in natural immunity. 

4. I was Professor of Department of Biochemistry, Hyogo College of 
Medicine, Nishinomiya, Hyogo, Japan, from 1996 to 1999. 

5. I have been Professor of the Research Institute for Microbial 
Diseases, Osaka University, Suita, Japan, from 1999 to the present. 

6 . I have been Director of the Japanese Society for Immunology from 
2003 to the present. 

7. I have authored and co-authored about four hundred scholarly 
papers in the field of immunology and molecular biology, as reflected 
in ray curriculum vitae, an accurate copy of which is annexed hereto 
as Exhibit "A" . 



8. I have authored, co- authored and edited about twenty textbooks 
in the field of immunology and molecular biology. 

9. I have no affiliation with KK Hayashibara Seibutsu Kagaku 
Kenkyu jo, located in Okayaraa, Japan. 

10. I am familiar with gene recombinant technology and protein 
engineering, as well as with the scientific literatures on 
interleukins and the content of European Patent No. 0712931. 

11. In March 2005, I was asked by Mr. Toshio MIYAKE , Executive 
Director, Intellectual Properties Center, KK Hayashibara Seibutsu 
Kagaku Kenkyu jo, to give a declaration on the polypeptide (human 



interleukin 18) in the above-identified patent. 

12. As seen in my curriculum vitae, I believe that I am sufficiently 
specialized to make such a declaration. 

13. I have read and am thoroughly familiar with the version of the 
specification relating to the* above -identified patent as originally 
filed. I make this Declaration based on my view of what would have 
been directly and unambiguously derivable from the specification at 
the earliest priority date, when account is taken of matter that would 
have been implicit to me at that time as a skilled addressee. 

14. I have read claim 1 as granted for the above -identified patent 
and, in my view, it does not add subject matter which extends beyond 
the content of the application as originally filed, when account is 
taken of matter which is implicit to the person skilled in the art . 

15. I have read claim 4 as granted for the above- identified patent 
and, in my view, it does not add subject matter which extends beyond 
the content of the application as originally filed, when account is 
taken of matter which is implicit to the person skilled in the art . 

Claim 1 

16. In my view a polypeptide of human origin which induces IFN- 7 
production by immunocompetent cells and comprising a part of the amino 
acid sequence as depicted in SEQ ID NO:l, said part including at least 
the first ten amino acids as depicted in SEQ ID N0:1 would have been 
directly and unambiguously derivable by the skilled addressee from the 
specification. 

17. Page 13 lines 4 to 6 of the specification teach that a polypeptide 
according to the invention that is the subject of the above-identified 
patent may be a polypeptide comprising an amino acid sequence as 
depicted in SEQ ID NO:l. 

18. Page 13 lines 14 to 28, page 21 lines 11 to 22, and page 27 lines 
8 to 18 would have made it clear to the skilled addressee that a 
polypeptide according to the above-identified patent includes 
variants and homologous amino acid sequences to the amino acid sequence 
shown in SEQ ID NO:l. More specifically, these passages envisage: 

-replacing one or more amino acids in SEQ ID NO: 1 with other amino acids 
without altering the inherent biological activity (page 13 lines 16 
to 18; page 21 lines 18 to 19; and page 27 lines 14 to 15); 

-removing one or more amino acids near to the N and/or C termini in 
SEQ ID NO:l or adding one or more amino acids near to the N terminus 
in SEQ ID NO: 1, again while keeping the inherent biological properties 
of the polypeptide (page 13 lines 21 to 27); 



-adding one or more amino acids to the N and/or C termini in the amino 
acid sequence of SEQ ID N0:1, again while substantially not losing the 
IFN- r production inducing activity (page 21 lines 17 to 18) ; 

-losing one or more amino acids in the N and/or C termini of the amino 
acid sequence in SEQ ID NO:l, while substantially not losing the IFN- 
r production inducing activity (page 21 lines 19 to 22); 

-adding one or more amino acids to the N and/or G termini, so long as 
the properties are retained (page 27 lines 17 to 22). 

19. The skilled addressee would have understood from the 
specification that polypeptides according to the invention defined in 
the above -identified patent include those shown in SEQ ID N0:1 even 
with substantial modifications to the primary amino acid sequence, 
provided that the inherent biological properties of the polypeptide 
are retained. By removing, adding or replacing (1, 2, 3, 4 etc) amino 
acids in SEQ ID NO:l there is disclosed a range of polypeptides, each 
having an amino acid sequence containing a part of SEQ ID NO:l. The 
skilled addressee would have understood that the limitation on this 
range is that all of the polypeptides in the range must retain the 
inherent biological properties . 

20. In view of the passage at page 13 line 27 to page 14 line 1, the 
skilled addressee would have understood that the inherent biological 
properties of the polypeptide which must be retained is the induction 
of IFN- T production by immunocompetent cells . 

21. The skilled addressee would have noted that all individual 
polypeptides disclosed in the above -identified patent, for example the 
polypeptides having an amino acid sequence as depicted in SEQ ID NO:l, 
SEQ ID NO: 7, SEQ ID NO : 8 or SEQ ID NO : 9 have in common the amino acid 
sequence as shown in SEQ ID NO: 7 (the first ten amino acids of SEQ ID 
NO : 1 ) . 

22 . The skilled addressee would have noted that a further polypeptide 
exhibiting these biological properties is disclosed in Example A- 6. 
The amino acid sequence containing the N terminus of this polypeptide 
is analysed in Example A-7-3. This analysis reveals an amino acid 
sequence where two amino acids have been added to the N terminus of 
SEQ ID NO: 7. The skilled addressee would have noted that this is in 
accordance with the teaching at page 13 lines 23 to 24 and page 21 lines 
17 to 18 and page 27 line 16. 

23. The skilled addressee would have noted that a still further 
polypeptide exhibiting these biological properties is disclosed in 
Example B-1-2 where one amino acid has been added to the polypeptide 
of SEQ ID NO: 7 . 



24. The first ten amino acids in SEQ ID NO:l are therefore disclosed 
as a lower limit for the range of polypeptides , each having an amino 
acid sequence containing a part of SEQ ID NO:l. In the light of common 
general knowledge, in combination with the remainder of the 
specification and particularly the passages referred in paragraph 18 
of this Declaration, a polypeptide comprising a part of the amino acid 
sequence as depicted in SEQ ID N0:1, said part including at least the 
first ten amino acids as depicted in SEQ ID NO:l would have been 
directly and unambiguously derivable by the skilled addressee from the 
specification . 

25. From the disclosures in the above -identified patent that I have 
referred to above, it would have been implicit to the skilled addressee 
that the retention of the inherent biological properties of the 
polypeptide having SEQ ID NO: 1 is dependent upon the polypeptide having 
an amino acid sequence comprising the amino acid sequence shown in SEQ 
ID NO: 7. It would have been a routine exercise to verify this 
conclusion by generating variants of SEQ ID N0:1 and screening for the 
desired inherent biological properties , namely induction of IFN- T 
production by immunocompetent cells, by means of usual recombinant DNA 
and protein engineering technologies (as illustrated in James D. 
Watson et al . , "Recombinant DNA", Second Edition, Chapters 11 and 23, 
published by W. H. Freeman and Company). 

26. It would have been clear to the skilled addressee, as a 
consequence of the disclosures in the specification for the above- 
identified patent, that all polypeptides of human origin having an 
amino acid sequence which comprises the amino acids sequence as shown 
in SEQ ID NO: 7 and capable of inducing IFN- T production by 
immunocompetent cells are the subject of the above-identified patent. 

Claim 4 

27. The specification for the above-identified patent unambiguously 
discloses a DNA encoding the subject polypeptide which has a base 
sequence containing a part of the base sequence depicted in SEQ ID NO: 2 . 

28. Page 14 lines 8 to 11 explicitly recite that one or more bases 
in the base sequence SEQ ID NO: 2 can be replaced with other bases by 
means the degeneracy of genetic code without altering the amino acid 
sequence of the polypeptide. Further, original claim 4 in the 
above -identified patent refers to homologous base sequences. 

29. The skilled addressee would have noted that original claim 4 is 
dependent on claim 3 to define a DNA which encodes a polypeptide of 
claim 1. The polypeptide of original claim 1 comprised an amino acid 
sequence as depicted in SEQ ID NO:l or its variants. The skilled 
addressee would have understood from the passages I already have 
referred to in paragraph 18 of this Declaration that variants are 
obtained by removing, replacing or adding amino acids. SEQ ID NO: 2 
encodes the polypeptide having amino acid sequence as depicted in SEQ 



ID NO:l. As such, it would have been clear to the skilled addressee 
that base sequences which encode "variants" of SEQ ID N0:1 would by 
necessity include variant base sequences containing a part of the base 
sequence as shown in SEQ ID NO: 2 • 

I declare that all statements herein of my own knowledge are true and 
that all statements made on information and belief are believed to be 
true . 
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Preface 



Application of recombinant DNA techniques to biology 
is bringing about a revolution in our understanding of 
living organisms. There is no field of experimental bi- 
ology that is untouched by the power we now have to 
isolate, analyze, and manipulate genes. When the first 
edition of Recombinant DNA was published in 1983, re- 
conabinant DNA techniques were already being used 
extensively for the analysis of viral and bacterial genetics, 
but dissection of eukaryotic genes was only just begin- 
ning. There were hints of what was to come. The concept 
of the gene as a continuous stretch of DNA had been 
shattered with the discovery of introns, but alternative 
splicing and geries-within-geries were yet to be revealed- 
Identification of cellular oncogenes seemed to promise 
an understanding of cancer, but the mechanisms of their 
action — and the existence of tumor suppressor genes — - 
were still subjects for speculation. A handful of genetic 
diseases were being analyzed at the molecular level, but 
the isolation of the disease genes and the development 
of gene therapy were yet to come. 

Our. aim in writing the second edition of Recombinant 
DNA is to show how recombinant DNA techniques have 
led to the explosion in pur knowledge of fundamental 
biological processes. As in the first edition, which was 
subtitled A Short Course, we provide a concise presentation 
of the methods, underlying concepts, arid far-reaching 
applications of recombinant DNA technology. The field 
has grown since the publication of the first edition, and 
so has our book. But even though our previous subtitle 
may be inappropriate for this enlarged edition, our ap- 
proach to the material has remaiDied true to the spirit of 
the "short course": as before,, the uninitiated will find 
access to the field of recombinant DNA here. 

The book is now divided into six major sections. The 
first five chapters, which are largely unchanged from 
the first edition, provide a historical introduction to the 



early development of recombinant DNA technology, up 
to the point when studies of eukaryotic organisms began 
in earnest. In the next section we describe in detail the 
methods currently used to clone and analyze genes, and 
devote an entire chapter to the polymerase chain re- 
action, wKich has. had an extraordinary impact on re- 
search. The great power of recombinant DNA techniques 
comes from the ability to explore gene functions by 
manipulating genes and then introducing them back into 
cells. The third section of the book discusses how this 
is done in mammalian cells, yeast, mice, and plants. The 
fourth section describes the progress these manipulations 
have allowed in key areas of biology. Here the range of 
recombinant DNA applications is demonstrated, from 
the analysis of cell cycle control and embryonic devel- 
opment, to die isolation of genes involved with brain 
function. Indeed, these techniques have spawned a whole 
industry — ^biotechnology. In the fifth section, we describe 
some of its accomplishments, including the development 
of genetically engineered pharmaceutical and agricul- 
tural products, and the studies of the human immuno- 
deficiency virus that are leading the attack on AIDS. The 
differences between the first and second editions are 
perhaps most evident in the final section, where we de- 
scribe the revolution in human molecular generics and 
the ways in which recombinant DNA techniques are 
providing new methods for diagnosis and treatment of 
human inherited diseases. 

The topics that are covered and the approach we take 
to describing them make this book suitable for under- 
graduate and graduate students in molecular biology, 
cell biology, biochemistry, genetics, or biotechnology 
courses; for medical students and physicians; and for 
others who have an interest in recombinant DNA tech- 
niques — for example, forensic scientists, patent attor- 
neys, and science journalists. 
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Textbooks dealing with biochemistry, molecular ge- 
nerics, and molecular biology usually present information 
withp-ut describing the experiments that were done to 
obrafh it. We think that this is a pity, because designing 
and doing experiments is exciting and fun. As in the first 
edition, we have used real experiments to illustrate im- 
portant biological phenomena, and we have plundered 
our colleagues' papers for interesting examples. Figures 
are used profusely to try to make complex real-life ex- 
periments intelligible, but inevitably we have not been 
able to present all the subtle details. Those who want to 
explore these details will find the experiments in the 
research papers listed at the end of each chapter, and 
the review papers we cite will provide an entry point to 
each topic. 
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In Vitro 
Mutagenesis 



1^- ecombinant DNA technology and DNA sequencing provided the tools 
i.\x-to clone and characterize genes. As we learned in Chapter 8, simple 
inspection of gene sequences told us- much about genomic organization. Func- 
tional sequences, such as transcriptional control elements, could often be iden- 
tified by comparing sequences of a number of genes. However, to delve deeply 
into the stniGture and function of genes required the ability to change the DNA 
sequence and examine the effect of the change on gene fdncrion. For decades 
before the advent of recombinant DNA, this was done by classical genetics, the 
identification of mutant organisms with new properties. From the genetic prop- 
erties of mutants, information about the structure and function of the underlying 
genes could often be inferred. This approach, however, was limited to organisms 
in which simple generic analysis was possible — bacteria, yeast, fruit flies. Generic 
analysis of more complex, longer-lived organisms like mice and men was slow 
and difficult. 

Recombinant DNA changed all that. The ability to isolate genes as molecular 
clones, the development of tools to modify gene sequences in the test tube, and 
the power to return altered genes to the organism to test their fiinction have 
revolurionized the way genetics is done in higher organisms. Because we now 
often work "backwards" from gene sequence to gene function, in contrast to 
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, FIGURE 11-1 
General strategy for an in vitro mutagenesis experiment 
Most procedures for in vitro mutagenesis follow the same 
basic scheme: Plasmid DNA is "mutagenized" in vitro, then 
mtroduced into E. colt by transformation. Depending on the 
method, mutant clones can be isolated and tested individ- 
ually, or a library of mutant plasmids can .be obtained, 

-which are tested using a generic screen. 



classical genetics, this new approach spawned by re- 
connbinant DNA is called reverse genetics. In this chapter 
we will learn ways to alter the sequence of a cloned 
gene at will and how these methods are used to un- 
derstand the structure and function of genes and gene 
products. 

In Vitro Mutagenesis Is Used 
to Study Gene Function 

In vitro mutagenesis of cloned genes has become a 
standard tool in the functional analysis of nucleic acids 
and proteins. Most procedures follow the same basic 
scheme (Figure 11-1). Plasmid DNA containing the 
gene of interest is treated in vitro by some mutagenesis 
procedure that alters the DNA either chemically or 
enzymatically. The mutagenized plasmid DNA is in- 
troduced into £ colt by transformation, and colonies 
containing plasmid molecules are selected by anti- 
biotic resistance. Mutants can be made one at a time, 
or hundreds of different mutants can be created in a 
single mutagenesis experiment. Mutant plasmids can 
be isolated from single colonies and tested individ- 
ually. Alternatively, plasmid DNA can be prepared 
from pooled colonies and the resulting library tested 
en masse to identify mutant plasmids. 

The various approaches to mutagenesis can be 
grouped broadly into random and site- directed meth- 
ods. Random methods put mutations anywhere in a 
plasmid. They are best used to identify the location 
and boundaries of a particular function within a cloned 
DNA fragment and are most readily used for this 
purpose when a simple genetic screen (or selection) 
is available, A genetic screen or selection consists of 
a system to test the function of the DNA of interest 
in cells without having to isolate each plasmid indiv- 
idually. Random mutagenesis is often used as a first 
step, when litde is known about the function encoded 
by particular DNA fragment Analysis of random mu- 
tants generally provides only a simple identification 
of the functional region but does not explain how 
things work on a molecular level. The value of such 
a strategy is that it quickly helps to narrow down the 
focus of attention from a large DNA fragment to a 
smaller region that can be studied subsequendy in 
greater detail. As we will learn, random mutagenesis 
can be accomplished by several different methods, 
such as altering the sequences within restriction en- 
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donuclease sites, inserdng an oligonucleotide linker 
randomly into a plasmid, damaging plasmid DNA in 
vitro with chemicals, or incorporating incorrect nu- 
cleotides during in vitro DNA synthesis. 

Once an important functional domain in a gene has 
been identified by random mutagenesis, site-directed 
methods — putting mutations precisely where they are 
needed — are used to define the role of specific se- 
quences. In addition, directed mutagenesis provides a 
powerful tool for the analysis of protein function, by 
allowing researchers to make specific and subtle 
changes in the strucmre of the protein. A number of 
strategies have been developed to consmict site-di- 
rected mutants in vitro, but tlus . type ot m 
is best accomp hshed using synthetic oligonucleotides. 
With an oligonucleotide the d e sired sequ ence is sim- 
ply built into die wild-type framework. Nowadays, 
oligonucleotide-directed mutagenesis reacdons are 
relatively straightforward, and oligonucleoddes are 
cheap and easy to obtain. The limitation of site-di- 
rected mutagenesis is that you must already have 
enough information to know what you wish to change. 
There are two standard ways of using oligonucleo- 
tides to construct site-directed mutants: mutagenesis 
by gene synthesis and mutagenesis by enzymatic ex- 
tension of a mutagenic oligonucleotide. By using de- 
generate oligonucleotides (see Chapter 7) a set of 
"random" mutations at a specific site can also be made. 

Restriction Endonnclease Sites Provide 
the Simplest Access for Mutagenesis 

One of the first experiments done with a cloned DNA 
fragment is to map the positions of restriction endo- 
nuclease cleavage sites in the DNA by using a battery 
of different enzymes. Although this information could 
be precisely obtained from the DNA sequence, map- 
ping restriction sites can be accomplished rapidly and 
is often done in conjunction with sequencing. Re- 
striction endonuclease recognition sites provide the 
simplest way to modify a DNA clone in vitro (Figure 
11-2). Cleaving plasmid DNA with a restriction en- 
zyme that recognizes only one site produces a linear 
molecule. This sbrves as an entry point for modifying 
the DNA sequence in the vicinity of the restriction 
site. For example, the enzyme EcdBS recognizes the 
sequence GAATTC and produces ends with 5' ov- 
erhangs. The ends can be made even (blunt) by treating 




/ ^ ^ 

4 bp deleted 4 bp inserted 

FIGURE 11-^2 

Creating a mutation by mampularion of a restriction site, 
Plasmid DNA is eleaved with Eco^ restriction endonu- 
clease, which generates a linear fragment with 5' ends that 
have four unpaired nucleotides (so-called sticky ends).- 
Treatment with Si nuclease (left) removes these nucleo- 
tides, and the linear fragment is then treated with DNA li- 
gase. The resulting circular molecule contains a deletion of 
4 bp. Alternatively, addition of DNA polymerase and deoxy- 
ribonucleotide triphosphates (dNTPs) to the plasmid cleaved 
by EcoSii extends die 3' ends by DNA synthesis (right). 
After ligation, the resulting molecule contains an insertion 
of 4 bp. In both cases, the Eco^ site has been destroyed. 
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FIGURE 11-3 

Linker insertion mutagenesis to map functional domains of a 
bacterial transposable element. The starting plasmid contains 
an intact transposon, an ampicillin-resistance gene for selec- 
tion Sn £ coli, and sequences for plasmid replication. The 
DNA is ueated with a low concentration of deoxyribonu- 
clease I in the presence of Mn-'"". Under these conditions, 
the enzyme makes double-stranded cuts at random positions 
in the plasmid, generating a collection of linear DNA mole- 
cules broken at different positions. Oligonucleotide linkers 
encoding an EcoRl restriction site are added to the ends 
with DNA ligase, the linear molecules are' treated with 
Ecom endonuciease to create sticky ends on the linkers, and 
the molecules are recircularized. The circular molecules are 
transformed into £ coli, and ampicillin-resistant colonies are 
selected. Plasmid DNA is isolated from individual colonies, 
introduced into another strain of E, coli, and tested for activ- 
ity of the transposon. The positions of the inserted linkers 
are mapped by restriction digestion. Linkers inserted in one 
region (blue) of the plasmid inactivated the transposon. No 
hnker insertions in the ampicillin-resistance gene were re- 
covered, because these plasmids would fail to yield a drug- 
resistant colony in the original selection of transformed 
K coll 



the cleaved DNA with DNA polymerase in the pres- 
ence deoxyribonucleotide triphosphates. The two 
blunt ends can then be hnked together again (ligated) 
by incubating the linear plasmid molecule with DNA 
ligase. A few nanograms of DNA from the in vitro 
ligation reaction is used to transform E, coli, and the 
new modified plasmid is isolated from one of the re- 
siilting colonies. The net result of these manipulations 
is to insert 4 bp into the plasmid at the EcoRl site. 
Alternatively, a small deletion mutation can be made 
by treating the linearized DNA with Si nuclease, 
which specifically digests single-stranded DNA. This 
creates blunt ends by removal of the four nucleotides 
that constitute the 5' overhang generated by EcoRl at 
each end. Subsequent ligation of the DNA into a co- 
valently closed circular molecule thus results in the 
deletion of 4 bp from the DNA. In each example, the 
new sequence no longer encodes the Ecom recognition 
site. These types of manipulations, if done to a protein- 
coding sequence, would change the translational read- 
ing frame, resulting in production of a grossly altered 
protein. The major limitation of using restriction sites 
to make mutations is that there simply may not be 
sites in regions of the gene the experimenter wishes 
to alter. 
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Linker Insertion Is Used to Map a 
Bacterial Transposon 

We have learned that it is a sinaple naatter to cleave 
a plasmid with a restriction enzyme, blunt the ends 
by treatment with a DNA polymerase, and rejoin them 
by ligation. A variation on this technique is to rejoin 
the ends in the presence of a synthetic oligonucleotide 
"linker," often one that encodes a restriction site. In- 
sertion of the. linker disrupts the gene sequence; the 
position of the inserted linker can be easily mapped 
by cleavage of the plasmid with the restriction enzyme 
that cuts the linker. 

A similar method was used to define the functional 
regions of a bacterial transposable element (a "jumping 
gene," see Chapter 10), by inserting linkers at many 
alternative positions throughout the element. To place 
linker insertions in the transposon, a plasmid carrying 
a clone of the transposon was treated with a nuclease 
that cleaved the plasmids at random positions (Figure 
1 1-3). Cleavage conditions were adjusted so that each 
plasmid was cut just once on average. The linearized 
molecules were isolated and ligated into circles again 
in the presence of an 8-bp linker oligonucleotide con- 
taining an Eco^l restriction site, resulting in insertion 
of the linkers into random sites, one in each plasmid. 
The resulting plasmids were transformed into £. coli 
and, using a genetic screen, examined to see if the 
transposon could jump. Insertion of a linker into a 
region of the transposon critical for its fiinction in- 
activates it, presumably by putting a protein-coding 
sequence out of frame. By mapping the positions of 
the inserted linkers by restriction analysis, the loca- 
tions of functional regions of the transposon were 
deduced- 



Construction of Nested Deletions Maps 
the Boundaries of a Transcriptional 
Control Region 

Transcription of the gene encoding the 5S ribosomal 
RNA molecule is carried out by RNA polymerase III 
(pol III, see Chapter 8). To identify the sequences 
within the 5S gene required for transcription by pol 
III, a series of deletion mutations was made and tested 



for their ability to support accurate transcription. Two 
sets of deletions were made. One was made by cutting 
a plasmid carrying a cloned 5S gene at a restriction 
site on the 5' side of the gene. The linearized plasmid 
was treated with a combination of nucleases that di- 
gested away DNA from the ends of the molecule 
(Figure 11-4). The amount of DNA removed was 
controlled by varying the time, temperature, or en- 
zyme concentration in the reaction, A second set of 
deletions was generated from plasmid DNA cleaved 
at a site on the 3' side of the gene. The result was 
two sets of plasmids with progressively larger deletions 
toward the gene from both direcdons. Testing these 
genes revealed that only deletions entering a 35-bp. 
region within the transcribed region of the 5S gene 
abolished transcriprion by pol III Therefore, this dele- 
tion analysis mapped the transcriprional regulatory 
element to this 35-bp stretch, which has subsequently 
been analyzed in much greater detail by site-directed 
mutagenesis. 

Several different types of enzymes can be used to 
produce deleuons. Generally, these enzymes delete 
DNA from both ends of a linearized plasmid molecule. 
Often, however, one end of the molecule contains 
sequences that need to be retained in the plasmid 
because, for example, they are required for plasmid 
replication. In the 5S gene deletion experiment, this 
limitation was accommodated by isolating the deleted 
gene fragments and recloning them into a new vector. 
Alternatively, a strategy can be used that limits dele- 
tion to one end of a linearized plasmid molecule (Fig- 
ure 11-5). This method is widely used to generate 
nested deletions for DNA sequencing (see Chapter 7). 



Linker-Scanning Mutagenesis Permits 
Systematic Analysis of Promoters 

Deletion mutagenesis of the 5S gene mapped the 
boundaries of the transcriptional control region in the 
gene. But not all the nucleotides within the boundaries 
of that 35-bp region are necessarily critical for func- 
tion. Therefore, methods were needed to change in- 
dividual nucleotides in a target without generating 
gross deletions or other rearrangements. This was ac- 
complished for a viral promoter using an elegant ad- 
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FIGURE 11-4 

Consmacdon of a nested ser of deletion mutants to map the 
transcnpnon control region of a 5S ribosomal RNA gene, 
(a) A plasmid clone was linearized with a restriction enzyme 
at a position (A) on the 5' side of .the gene. The linear frag- 
ments were treated with an exonuclease, which digests 
DNA from both ends of the molecule. Portions of the reac- 
tion were removed at different times to recover populations 
•of molecules with progressively larger deletions. Linkers 
were added to the ends, and the molecules were cleaved 
With restriction enzymes specific for'sites B and C to sepa- 
rate the 5S gene fragments from the remnants of the vector. 
The fragnnients were recloned into a new vector, generating 
the set of rightward deletion mutants. To create the left- 
ward deletion mutants, this process was repeated after 
^ cleaving the plasmid at restriction site B. (b) Individual plas- 
- mids were isolated after transformation, tlieir deletion end- . 
points determined by DNA sequencing, and their ability to 
support transcription by RNA polymerase III tested with an 
in vitro assay. As can be seen by comparing transcription 
activity with the e-xtent of deletion, transcription is inhibited 
when the rightward (5') deletions enter the 4-40 region and 
when the leftward (3') deletions pass the + 80 point This 
suggests that the transcription control region lies between 
4-40 and 4-80. 
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aptation of deletion mutagenesis called linker scanning 
Using the methods outlined in Figure 11-4, two sets 
of plasmids were conspructed that contained deletions 
within the promoter. One set of deletions started from 
a site beyond the 5' end and proceeded toward the 
gene, leaving the 3' end intact; the other set started 
at a point within the gene and proceeded in the op- 
posite direction, leaving the 5' end intact. Each dele- 
tion terminated with a 10-bp BatnHl linker. The extent 
of the deletion in the DNA was determined for each 
plasmid by DNA sequencing. Pairs of plasmids from 
the two deletion sets with endpoints 10 bp apart were 
recombined at their BamHl sites (Figure 11-6). The 
effect was to preserve the length and organization of 
the promoter — thought to be important for promoter 
function — but to replace various 10-bp segments of 
wild-type promoter sequence with the sequence in the 
linker. Thus, this experiment created a Hbrary of pro- 
moter mutants of similar structure but with nucleotide 
substitutions clustered within 10-bp windows located 
at various sites iii the promoter. This collection of 
mutants spanned the length of the promoter. The re- 
sults of this analysis were discussed in Chapter 9. At 
the rime, this experiment represented the most thor- 
ough analysis of a promoter in a mammalian gene. 
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FIGURE 11-5 

Construction of unidirecrional deletions using exonuclease 
III. Exonuclease III attacks preferentially the 3' end of a lin- 
ear DNA molecule with 5' protruding nucleotides. There- 
fore, by cleaving a plasmid molecule at adjacent sites with 
BamHl, which leaves a 5' overhang, and PsA, which leaves a 
3' overhang, only the end generated by BamHl is attacked 
by exonuclease IIL After exonuclease III treatment, the re- 
maining single-stranded tail (along with the overhang at the 
other end) is removed with Si nuclease, which digests only 
single-stranded DNA. An oligonucleotide linker is attached, 
and the fragments are ligated to form closed circular mole- 
cules. In the experiment shown here, deletions are being 
used to map the functional domains of a cloned gene in- 
serted in an expression vector. This strategy allows dele- 
tions to be made only in the cloned gene, without damaging 
the promoter sequence. 



Random Nucleotide Substitutions Are 
Obtained by Chemical Modification of 
DNA or by Enzymatic Misincorporation 

While linker scanning allows the creation of nucleo- 
tide substitutions, each mutant generally contains sev7 
eral substitutions, and the positions of the mutations 
depend on the availability of appropriately placed 
deletions. Therefore, several strategies have been de- 
veloped for placing single nucleotide substitutions at 
random positions in a DNA molecule. The simplest 
methods employ chemicals that modify or damage 
DNA. Generally, plasmid DNA or DNA fragments 
are treated with chemicals, transformed into K colt, 
and propagated as a library of mutant plasmids. Chem- 
icals most commonly used for in vitro mutagenesis 
include sodium bisulfite, which deaminates cytosine 
residues to uracil, and reagents that damage or remove 
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FIGURE 11-6 

^ir^^rTr'^"^ mutagenesis of the viral promoter for the thymidine kinase (TK) gene Two 
he oroinr K "k""'' ."""^^ ''"^'""^"^ ^""^ ^^'^ °" 5' and 3^ sides of 

were seonen^.A T''"' "^'^f Approximately one hundred plasmids 
rdeletion of one f ''"""^ ""'^P"'"'^" «f deletion fragments, where the 

deleJon of 1 n^h r^""' P'^."'^'': '° ''^ downstream from- the endpoint of the 3' 

deleaon of the other fragment, were identified. The HMIll-Bamm fragment of the 3' dele- 
ZmTl ^^-m-^^I fragment of the 5' deledon mutant wfre joined via thef 

ILnfn^ ? K P °^ *^ ^= delenon endpoints. In the 

example shown, th.s results m a cluster of eight nucleotide substitutions (arrows) 
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FIGURE 11-7 

Chemical mutagenesis using sodium bisulfite. Sodium bisul- 
fite reacts with cytosine bases of single- stranded DNA to 
convert them to uracil, a thymine analog that base- pairs 
with adenine. Single-stranded DNA is created with sodium 
bisulfite to modify a small number of cytosine residues in 
each molecule. An oligonucleotide primer is annealed to the 
DNA and serves as a primer for synthesis by DNA poly- 
merase. When the polymerase encounters a uracil in the 
template strand, it incorporates an adenine into the newly 
synthesized DNA. Since the vector sequences are also dam- 
aged by bisulfite treatment, it is necessary to excise the dou- 
ble-stranded DNA fragment by restriction endonuclease 
cleavage and redone it into an undamaged vector. Follow- 
ing transformation into £ coli, a library of mutant plasmids 
can be isolated or individual plasmids can be purified and 
tested. The average number of subsiitutaons in the DNA 
fragment can be controlled by altering the conditions of bi- 
sulfite creatmenc 



bases, thereby preventing normal Watson-Crick base- 
pairing (these include hydrazine and formic acid, 
which are used in Gilbert-Maxam DNA sequencing. 
Chapter 5). Most often, chemical mutagenesis is per- 
formed on single-stranded DNA and followed by in 
vitro synthesis of the complementary strand using a 
DNA polymerase (Figure 11-7). This synthesis in- 
corporates the mutation into the new strand. In DNA 
treated with bisulfite, an adenine nucleotide is incor- 
porated opposite the uracil; after transformation into 
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E. coli, the wild-type C-G base pair becomes a T--A 
pair. In DNA treated with reagents that eliminate 
bases, any nucleotide can be incorporated opposite the 
"abasic" site, which still retains its deoxyribose back- 
bone although it has lost its base. The major limitation 
of chemical mutagenesis is the specificity of the in- 
dividual reagents: bisulfite mutagenesis, for example, 
changes only cytosines. 

All possible nucleotide substitutions can be gen- 
erated using enzymatic misincorporation. Here the 



FIGURE 11-8 

Oligonucleonde-directed mutagenesis by enzymstic primer 
extension. A "mutagenic" oligonucleotide encoding the de- 
sired mutation embedded in wild-type flanking sequence is 
annealed to a single-stranded DNA template. The sequence 
of the oligonucleotide is complementary to the template ex- 
cept for the nucleotides that define the mutation. Generally, 
the mutagenic oligomer is designed so that the mismatched' 
nucleotides are positioned in the middle and there are at 
least 8 to 12 nucleotides on either side chat base^pair with 
the template DNA The mutagenic oligonucleotide serves as 
a primer for DNA synthesis by DNA polymerase. Once the 
enure template has been copied, the ends of the newly syn- 
thesized strand are covalendy linked by DNA ligase. The 
heteroduplex DNA is transformed into £ coli Theoretically, 
bodi strands can replicate, segregating into separate mutant ' 
and wdd-type plasmids. In practice, however, most colonies 
contain only one or the other, because enzymes in the cell 
recognize and repair mismatched nucleotides in the hetero- 
duplex beforfe rephcarion. Plasmid DNA is isolated from the 
resultmg colonies and is screened to identify mutants. 



Strategy is to perform in vitro DNA synthesis under 
nonideal conditions — suboptimal ionic conditions, 

unbalanced concentrations of nucleotide precursors 

that encourage DNA polymerase occasionally to in- 
corporate the wrong nucleotide during synthesis. For 
example, synthesis is carried out in the presence of 
high concentrations of three of the precursors and a 
very low concentration of the fourth. At positions that 
normally call for the fourth (scarce) nucleotide, one 
of the others is sometimes incorporated instead These 
methods also exploit DNA polymerases that lack a 
proofreading activity — a 3' to 5' exonuclease mech- 
anism that checks each base pair after incorporation 
and removes nucleotides that are mismatched. Thermus 
aquaticus (Taq) DNA polymerase, used in the poly- 
merase chain reaction (Chapter 6), lacks such an ac- 
tivity. Though this is a problem when accuracy of 
synthesis is required, the PCR is a very simple and 
efficient way to introduce random nucleotide substi- 
tutions into a DNA fragment 

A general problem with random mutagenesis ap- 
proaches is that they often produce mutants with more 
than one substiration. Multiple substitutions in a single 
mutant complicate the interpretation of an experi- 
m.ent, because it isn't clear which substitution (or 
which combination of substitutions) is responsible for 
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obseryed changes in the properties of the mutant. Ex- 
traordinary methods have been used to circumvent 
this problem — essentially, significantly reducing the 
extent of mutagenesis and using enrichment protocols 
to find rare mutants — but almost all these procedures 
have been supplanted by new methods that use syn- 
thetic oligonucleotides. 



Synthetic Oligonucleotides 
Facilitate Mutagenesis 

Most of the methods for mutagenesis we have dis- 
cussed so far have some significant shortcoming — they 
rely on fortuitous access, to a sequence via a restriction 
site, forced entry through deletion strategies, or te- 
dious screens to find randomly generated mutations 
in the region of interest. ^To be most powerful, mu- 
tagenesis must allow the experimenter to place any 
modiftcauon at any position desired m cloned DJSiA. 
This bas be^^^^ possible, but simple and 

cheap, with the adve^^^ jJNA oligonu- 

cleotides. Oligonucleo^^^^ means to de- 

sign a particular m utation and then to place it precisely 
where you want it. 

The simplest method for doiiig oligo nucleotide - 
directed mutagenesis is by enzymaac primer extension 
(Figure . 11-8). In this rnethod, an oligonucleotide is 
designed that carries the mutation flanked by 10 to 
15 nucleotides of wild- type sequence. This "muta- 
genic" oligonucleotide is hybridized to its comple- 
mentary sequence in single-stranded wild-type DNA 
prepared from a phage or phagemid clone, forming a 
heteroduplex with mismatched nucleotides at the site 
of the mutation. Although the oligonucleotide is not 
perfecdy complementary, it will anneal if the hybrid- 
ization conditions are riot very stringent The oligo- 
nucleotide serves as a primer for in vitro enzymatic 
DNA synthesis by a DNA polymerase that converts 
the single-stranded DNA into double-stranded form, 
using the wild-type strand as template. In this way, 
all regions of the plasmid. except the region containing 
the mutagenic oligonucleotide will be wild-type in 
sequence. Once the primer has been extended com- 
pletely around the tem'plate, the ends of the newly 
S3mthesized strand are ligated, forming a double- 



stranded circular DNA molecule.' This heteroduplex' 
DNA — one strand has the wild-type sequence and the 
other strand has the mutant sequence — is transformed 
into K coll, where either strand can be rephcated. By 
the time a colony grows up, however, it usually con- 
tains only one type of plasmid, wild-type or mutanc 
The types of mutations that can be made by this a p- 
proach range from single nucleotide substitutions to 
deletions or insertion^ hmited only by the si^e of the 
oligonucleotide needed. ^■ 



Mutant Clones Can Be Identified by 
Hybridization and DNA Sequencing 

Theoretically, half the daughter molecules of a mu- 
tagenesis reaction will be wild-type and half mutant 
In practice, however,, the pxecentage of mutant pla$- 
mids is often much lower. This is due to a variety of 
technical factors^ but the consequence is that methods 
for identifying or enriching mutant clones are vital 
Mutant molecules can be distinguished from wild- type 
if there is gain or loss of a restriction site. Alternatively, 
the oligonucleotide that was originially used to make 
the mutation can be used as a hybridization probe to 
distinguish mutant from wild^type molecules (Figure 
1 1-9). The mutagenic oligonucleotide is radio actively 
labeled with ^^P-ATP and hybridized to DNA from 
bacterial colonies on nitrocellulose filters, as described 
in Chapter 7. If the temperature of the hybridizarion 
is raised in 5 or 10°C increments, a point can usually 
be reached at which the labeled oligonucleotide will 
hybridize only to the mutant molecules (to which it' 
is perfecdy complementary) and not to the wild-type 
molecules, because die hybrid is destabilized by the 
mismatched nucleotides. Plasmid DNA is isolated 
from an EL coli colony that strongly hybridizes to the 
probe. Verification that the desired mutation was made 
is accomplished by sequencing the DNA of this pu- 
tative mutant clone. This technique can identify one 
mutant clone among several hundred wild-type clones. 

Several clever methods enrich for mutant clones so 
that the tedious. task of screening by hybridization is 
not necessary. In one of these techniques, the template 
DNA is biologically marked so that it is destroyed 
after transformation into £ coli 2iTid the mutant strand 
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FIGURE 11-9 

Searching for mutant plasmids using the mutagenic oligonu- 
cleotide as a probe. Colonies (or plaques) resulting from 
transformation by mutagenized plasmids (see Figure 11-8) 
are prepared for colony hybridization on nitrocellulose fil- 
ters using methods described in Chapter 7. The mutagenic 
oligonucleotide is radioactively labeled by phosphorylating 
its 5' end using ^^P-ATP and polynucleotide Icinase. The la- 
beled oligonucleotide is hybridized to the plasmid DNA on 
the nitrocellulose filters. At low temperature, the oligonu- 
cleoride will hybridize to both mutant and wild-type DNAs. 
As the temperature is increased, the mismatched oligonu- 
cleotide hybridized to the wild-type plasmid DNA begins to 
dissociate from the wild-type clones. Eventually a tempera- 
ture is reached at which the mismatched oligomers com- 
pletely dissociate from the wild- type clones but. remain 
hybridized to the mutants. Since the oligonucleotide is ra- 
dioacnvely labeled, the nitrocellulose filter is exposed to 
x-ray film and mutant clones are identified by the presence 
of a strong signal on the autoradiograph. Mutant plasmid 
DNA is then isolated from the corresponding colony on the 
master plate, using the replica filter as a guide. ' 



is preferendally replicated (Figure 1 1-10). In a second 
method, the template strand is enzymatically de- 
stroyed before transformaaon;Both methods can yield 
mutants at a frequency of greater than 50 percent, so 
that plasmid DNA is simply isolated from three or 
fr>ur randomly picked colonies and analyzed by DNA 
sequencing with the expectation that a mutant will be 
found among the DNA selected. 



Oligonucleotide Cassettes Provide a 
Simple Method for Introducing 
Directed Mutations 

We learned earlier that restriction enzyme sites pro- 
vide access to a cloned DNA for mutagenesis. If two 
restriction sites are close together, the intervening 
fragment can be removed and replaced with a synthetic 
double-stranded fragment (a cassette) made from two 
complementary single-stranded oligonucleotides car- 
rying any desired sequence. Often, however, conve- 
nient restriction sites are not available; fortunately, it 
is a simple matter to create them using the oligonu- 
cleotide-directed mutagenesis procedures described in 
the previous sections. Once the sites are in place, any 
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FIGURE 11-10 

Enrichment for oligonucleotide-directed mutants by using a 
uracil-containing template. Single-stranded template DNA is 
prepared in a strain- of K coli that lacks the enzyme uracil 
deglycosidase {ung~), so that it contains several uracil resi- . 
dues in place of thymines. (Although uracil is not usually 
incorporated into DNA, ir is not actually mutagenic and it 
does form a base-pair with adenine.) The mutagenic oligo- 
nucleotide is annealed and primes the synthesis of a strand 
that extends around the template in a reaction using the 
four standard dNTPs (as in Figure 11-8). Following ligation, 
the heteroduplex DNA molecules are introduced into an 
ung"^ strain of £ colL Once in the cell, the wild-type (tem- 
plate) strand is attacked by uracil deglycosidase, which 
causes breaks in the DNA strand, and the DNA strand is 
degraded before it can be replicated. Since the strand con- 
taining the mutagenic oligonucleotide does not contain ura- 
cil, it is not attacked and is replicated normally. When this 
procedure is used, 50 percent or more of colonies contain 
mutant plasmids. 
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number of new mutants can be made by inserting 
synthetic fragments into the plasmid (Figure 11-11), 
just as different cassettes can be inserted into a tape 
player. 

This method of cassette mutagenesis was the basis 
for an elegant experiment that verified a structural 
model for DNA recognition by phage repressors. The 
repressors of the A-like phages 434 and P22 contain 
a helix-turn-helix structure (see Chapter 9) that rec- 
ognizes the operator DNA in the phage genome. It 
was hypothesized that amino acid side chains on one 
face of an a helix in the. repressor protein make se- 
quence-specific contacts with operator DNA: To test 
this hypothesis, a helix swap was performed (Figure 
11-12). Oligonucleotides were synthesized that en- 
coded the amino acids of the helix in the 434 repressor, 
with the five positions thought to contact DNA 
changed to those found in the P22 repressor. This 
synthetic fragment was swapped for the natural frag- 
ment in the 434 gene. The resulting hybrid protein 
gained the recognition specificity of the P22 repressor, 
demonstrating that this helix indeed contacts the 
DNA. 

Cassette mutagenesis with degenerate oligonucleo- 
tides can be used to create a large collection of random 
mutations in a single experiment. This method was 
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used to itudy the structure of the glucoGorticoid re- 
sponse element (GRE), an enhancer-, sequence that 
acDvates a family of genes in response to certain steroid 
hormones. The element had been mapped by deletion 
mutagenesis to a 30-bp region in a glucocorticoid- 
regulated gene. To define the sequence required for 



FIGURE 11-11 

Mutagenesis by cassette replacement. Plasmid DNA is 
cleaved with restriction enzymes EcoRl and HmdUI which 
T "^"t^It^" ^^"'^ sequence to be mutated. The small 
Cleaved DNA fragment contaimng a portion of the wild- 
type sequence is removed, and a DNA fragment (cassette)' 
containing the desired mutation is ligated into the plasmid. 
This mutant DNA fragment is composed of two comple- 
mentary synthenc oligonucleotides that have EcoRI and 
//Win sticky ends when annealed. Because there is no hei- 
eroduplex intermediate— the mutant cassette is simply 
swapped for the wild-type fragment— the recombinant plas- 
nuds are all murants. A mutant cassette can be composed of 
degenerate oligonucleotides (see Chapter 7), resulting in a 
library of mutant plasmids containing different sequences 



GRE fiincnon precisely, single point mutations 
throughout the 30-bp region were generated ' and 
tested m cells for inducibility by glucocorticQid hor- 
mone. Two complementary oligonucleotides were 
synthesized that carried the 30-bp GRE, but synthesis 
was performed under conditions in which incorrect 
nucleotides were incorporated at a low frequency (Fig- 
ure 11-13). These "dqped" oligonucleotides (that is, 
oligonucleotides produced by. doping; see Figure 
11-13) were annealed and inserted as a cassette into 
a promoter diat lacked a GRE. Using this method, 
most single-nucleotide substinitions at the 30 positions 
were obtained. Such a collection of mutants would 
have been unthinkable before oligonucleotides revo- 
lutionized in vitro mutagenesis. 



Gene Synthesis Facilitates Production 
of Normal and Mutant Proteins 

The oligoiiucleotide-directed mutagenesis methods 
we have described use a single oligonucleotide or a 
pair of complementary oligonucleotides to insert mu- 
tant sequences into an otherwise natural DNA frag- 
"^^""^ Msh. ^ji^^jncrezsmg, availability .of l onger 
otigoriucleoudes, it is now feasible to assemble an e n- 
"re gene from synthetic umts. This is done by s^^ 
th'esizing a set of oligonucleotides, typically 40 to' 8 0 
. .nucleotides in length, that can be annealed and hgated 
in_vitro to assemble an entire double-stranded UNA 
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FIGURE 11-12 

The helix swap experiment Amino acids in the phage 434 repressor protein believed re- 
sponsible for recognition of the 434 operator were changed by cassette mutagenesis (Figure 
U-U) of 434 DNA to the amino acids believed to perform the same function in an analo- 
gous region of phage P22 repressor protein, (a) Expression in K coli of the 434 repressor 
protein (left), with an enlargement of the site believed to bind the 434 operator; (right) the 
corresponding section of the P22 repressor protein, (b) A cassette was synthesized resembling 
the 434 domain, but with P22-type substitutions at positions thought to be essential for re- 
cognizing .P2 2 operator DNA. This was ligated into the digested 434 plasmid, and the re- 
combinant vector was introduced into E. coli to produce the hybrid protein, which then 
recognized P22 operator DNA bur not the 434 operator. 
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FIGURE 11-13 

Cassecre mutagenesis using doped oligonucleotides to gener- 
ate numerous mutants in a siT,o\^ experiment An olic^onu- 
c eotide cassette encoding the glucocorticoid response 
element (ORE) was synthesized by a DNA synthesis ma- 
chme. Synthesis was done under conditions in which each 
' bottle contaimng a particular nucleoude precursor was 
• contammated" (doped) with small amounts of the other 
three precursors. In the example above, die DNA synthes- 
izer was instructed .to make an oligonucleodde with die se- 
quence GGTTACAAACT. Thus, when a nucleodde 
precursor is called for— a C for example— the machine 
adds aii aliquot of die solurion from the C bottle and a C 
base is coupled to die end of most of the oligonucleodde 
chains. However, because die C botde contains a small 
- amount of A, G and T, an incorrect base is somedmes 
added instead. Since the concentration of C is roughly 30 
omes that of A, G and T, an incorrect base will be added to 
about 1 out of 30 molecules. This results in a doped collec- 
non of ohgonucleoddes, which actually consists of many 
diifereat sequences, some wild-type and some with subsdtu- 
nons.-The level of contamination was adjusted to favor syn- 
diesis of oligonucleotides with only one subsdtudon, but 
because subsaradons occur randomly, some molecules in 
die GoUecnon had none and odiers had two or more Cas- 
settes were formed by anneahng complementary doped oli- 
gonucleoades and iigated into a vector. Plasmid DNA was 
isolated from 546 individual K coli trahsformants and ana- 
lyzed by sequencing. Of these, 224 were wild-type, 218 con- 
tained one subsdtudpn (for die 30 bases, of interest, 74 of 
the 90 possible smgle substitudons were recovered), and the 
rest contained two. or more. 



• IRQ|^fe(^iairg,.lldL4). In gene synthesis, the ex- 
RgggjjJ^gr HgQPtal controroyer the sequence of the *" 
gene. It Can be wild-tv pe ox mutanf in anv wav re- 
quired. Because most amino acids are encoded by 
multiple triplet codons. genes encoding wild-type pro- 
teins can be constructed using different codons. Co- 
dons can be chosen to place unique restriction sites 
throughout die sequence so diat mutant cassettes can 
be easily swapped in. This was done with die bacterial 
rhodopsin gene. Replacing a fragment of the synthetic 
gene with a new synthetic fragment identified die 
ammo acid rfiat is linked to the photon-absorbing 
chromophore diat initiates photosynthesis. Other frag- 
ments can be exchanged as cassettes to study other 
important structural features of the protein. , - 

Codons can also be changed by gene syndiesis to 
allow production of proteins at high levels in other 
organisms. Studies of die biochemistry of the Fos pro- - 
tein, encoded by a cellular protooncogene in animal 
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FIGURE 11-14 

Gene synthesis by ligation of complementary oligonucleo- 
tides. To synthesize a gene that encodes a protein of inter- 
est, a set of overlapping complementary oligonucleotides are 
designed that can be combined to form a Bouble-stranded 
DNA molecule that encodes the entire protein. The oligo- 
nucleotides are mixed together, heated at 90°C for a few 
minutes to denature the strands, and then cooled slowly to 
room' temperature- During this period the oligonucleotides 
anneal through complementary base pairs. The oligonucleo- 
tides are designed so that each one anneals to two adjacent 
oligonucleotides from the opposite strand, bridging them. 
Generally, oligonucleotides ranging in length from 40 to 80 
nucleotides are used in gene synthesis. The annealed oli- 
gonucleotides are covalently linked by DNA ligase, produc- 
ing two contiguous DNA strands. This synthetic gene is 
usually purified from a gel before ligation into a vector. The 
resultant recombinant plasmid is obtained following Trans- 
formadon into E. coli and is sequenced to check that the 
correct sequence was synthesized The sequence of the syn- 
thetic gene can be designed to place restriction sites at con- 
venient locations for cassette mutagenesis. 



cells (Chapter 18), have been severely hampered by 
the inability to produce the protein in E. colu This 
problem was finally solved by synthesizing a portion 
of the fas gene entirely from oligonucleotides, chang- 
ing natural codons to the codons used most effi- 
ciently in K coll Insertion of this syntheric gene into 
an K coli expression vector allowed for the first time 
the producdon of large quandties of active Fos protein. 
The gene was also designed with several unique re- 
striction sites so that efficient cassette mutagenesis can 
now be coupled to the biochemical assays for Fos 
function. 



The PGR Can Be Used to Construct 
Genes Encoding Chimeric Proteins 

The ease with which mutarions can be made in a 
protein coding sequence has revolutionized the study 
of protein function. A functional domain can be iden- 
tified by making a series of mutant proteins, then 
testing which substimtions cause a change in function. 
However, it is not often easy to decide where to make 
^ a mucationL In the example of the helix swap exper- 
iment (Figure 11-12), the domain that bound DNA 
had been previously identified- And the design of the 
experiment was guided by having a model for the 
three-dimensional structure of the repressor protein. 



DNA sequence or 
amino acid sequence 




Compiementory 
>- s/nrfietic 
oligonucleotides 



slowly cool to room temperature 



DNA ligase 



\^k^>i•?^^^s>^^^h^(^^^'^i^K'^ Isolate double- stranded 
DNA molecule 



DNA ligase 



Trohscriptipn al 
terminator 




Unique restriction 
sites for cassette 
mutagenesis 



Express recombinant 
protein in £. co/i 



However, for most proteins, little structural infor- 
mation is available. Identifying a functional domain — 
for example,, a region of the protein that may interact 
with another protein — is difficult to do by inspecting 
the primary amino acid sequence. A simple strategy 
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. FIGURE 11-15 

. Construction of a chimeric antibody heavy-chain-encoding gene by ''sticky feet -directed" 
mutagenesis. Antibodies containing a yZb heavy chain are known to participate in comple- 
ment.dependent cell lysis, whereas antibodies containing yl heavy chains do noL In order to 
idenufy which domain of the y2b heavy chain is responsible for this property, an antibody 
contammg a chimeric heavy chain was produced. To construct a gene encoding die chimeric 
heavy chain, i 400-bp fragment encoding the Ch2 domain from a y\ heavy chain was're- 
placed with die homologous segment from a y2b gene. Since there were no convenient re- 
striction sites at the ends of the Ch2 segments, the 400-nucleodde-Iong y2b DNA was 
prepared by PGR. The PGR primers were complementary to the ends of die y2b DNA but 
contained "additional nucleotides (the sticky feet) that were complementary to y\ DNA at the 
boundaries of the yl Ch2 domain. The strands of the PCR-generated fragment were sepa- 
rated by heating, then one strand'was used as the primer in a mutagenesis experiment using 
a uracil-contaimng single-stranded yl DNA template by the method shown in Figure U-10 
The resulting chimeric heavy-chain gene was coexpressed with a light chain gene in mam- 
malian cells to form an antibody that now activated complement. Since only the C„2 domain 
came from the y2b heavy-chain, this result demonstrated that the y2b C„2 domain contains 
the information necessary to activate complement-dependent cell lysis. Sticky feet-directed 
mutagenesis provided a simple means for constructing this complicated gene. 
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that helps to narrow down important amino acids in 
a protein is the analysis of chimeras between related 
proteins. We have previously discussed the use of 
computer programs to identify related proteins by . 
comparison of their amino acid sequences (Chapter 
8). Chimeric proteins are constructed by replacing a 
segment of one protein with the homologous segment 
from another protein. Although the two proteins have 
functional differences, their sequence similarity often 
indicates that they share a common overall structure. 
A striking example of this was in the analysis of human 
gro wth hormone (hGH). A series of chimeric proteins 
were made in which most of the amino acids were 
derived from hGH but which contained segments from 
related hormones, such as human prolactin. Using this 
strategy, regions of hGH that interact with the hGH 
receptor were identified. In Chapter 17, we' will see 
how functional regions of a receptor which spans the 
membrane seven times were identified by the study 
of chimeras. 

The 434/P22 repressor (Figure 11-12) and hGH 
chimeras were constructed by ligation of short oii- ^ 
gonucleoride cassettes into the coding sequence. A 
different strategy (Figure 11-15) was used to prepare 
a chimeric antibody in which a 400-bp segment from 
a yl heavy-chain gene was replaced by the homolo- 
gous segment from a y2b gene. A 400-bp DNA frag- 
ment was generated by PCR that encoded the new 
sequence to be inserted and two 30-base "sticky feet" 
on each end. The double-stranded PCR fragment was 
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heated to denature the two strands, and then one of 
the single-stranded molecules was utilized in a primer- 
extension experiment (as in Figure 11-8). Had the 
gene synthesis method been employed, construction 
of the chimeric gene would have required twenty 40- 
nucleotide-long oligomers. Instead, the sticky feet 
method used only two oligonucleotide primers for 
PCRi 

Mutagenesis Is the Gateway to Gene 
Function and Protein Engineering 

It would be difficult to overestimate the importance 
of in vitro mutagenesis techniques to biology and bio- 
technology. The harnessing of enzymes that operate 
on DNA and the refinement of oligonucleotide syn- 
thesis have made changing gene sequences an almost 
trivial task. And the ability to operate on DNA lets 
us also change thestructure of the products of genes — 
RNA and, most importantly, proteins. Thus, the im- 
pact of this technology is twofold. It has revolutionized 
how research is done in molecular biology by creating 
the entirely new concept of "reverse genetics" — 
changing gene sequence first, then examining gene 
function. And it opens the door to sophisticated protein 
engineering (see Chapter 23), the ability to make 
changes in natural gene products that make them do 
their jobs better. The impact of protein engineering 
on medicine and industry will be substantial. 
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Recombinant DNA 
in Medicine 
and Industry 



As soon as the first successful cloning experiments were rejported in 1973, 
applications for this powerful technology quickly followed. The signifi- 
cance of being able to produce large quantities of human proteins that were 
normally available in exceedingly small amounts, if at all, was not lost on 
scientists, physicians, and businessmen alike. In 1976 biotechnology became a 
reality as the methodologies for DNA cloning, oligonucleotide synthesis, and 
gene expression converged in a single experiment, in which a ^human protein 
was expressed from recombinant DNA for the first time. The protein was 
somatostatin, a 14 amino acid peptide neurotransmitter. The gene encoding 
somatostatin was not the natural gene but was synthesized- chemically and cloned 
into a plasmid vector for expression in K colt. Soon after followed the successfiil 
expression of human insulin for the treatment of diabetes, the first commercial 
product of the biotechnology industry. Instead of insulin extracted from the 
pancreases of pigs and cows, diabetics could now receive insulin identical to 
that normally produced by humans. 

The ability to achieve such feats relied on the successes in all areas of molecular 
biology, including oligonucleotide synthesis, isolation of enzymes that cleave 
and join DNA, characterization of bacterial plasmids, and an understanding of 
gene expression. These methods have, of course, revolutionized research in 
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biology and medicine, but what is equally important, 
they have spawned an entirely new industry, one de- 
voted to the cloning and production of proteins of 
importance to both medicine and industry. Today, 
proteins are produced through recombinant DNA 
technology for treatment of numerous diseases-^can- 
cer, allergies, autoimmune disease, neurological dis- 
orders, heart attacks, blood disorders, infections, 
wounds, and genetic diseases— as well as for more 
prosaic tasks, such as use in laundry detergents and 
food production- In addition, entirely new approaches 
to drug design have emerged from recombinant DNA 
technology, as scientists have gained the ability to 
tinker with natural proteins to improve their fiincrion 
and to change them in subtle and useRiI ways. 



Expression Systems Are Developed to 
Produce Recombinant Proteins 

Cloning the gene or cDNA encoding a particular pro- 
tern is only the first of many steps needed to produce 
a recombinant protein for medical or industrial use. 
The next step is to put the gene into a host cell for 
production. The development of expression systems 
has been an important research area in both industrial 
• and academic laboratories. The most popular expres- 
sion systems are the bacteria K coli and Bacillus sukilis, 
yeast, and cultured insect and mammalian ceils. We 
have learned in earlier chapters about the development 
of vectors and DNA transformation methods for these 
organisms. Here we will discuss the issues that are 
important for protein production. The choice of which 
cell is used depends on the project goals and on the 
properties of the protein to be produced. 

Bacterial cells offer simplicity, short generation 
times, and large yields of product with Jew costs. And, 
especially with B. subtilis^ the cells can be induced to' 
secrete the product into the culture medium, thus 
greatly simplifying the task of purification. But expres- 
sion in prokaryotic cells has several drawbacks. Al- 
though some proteins are expressed to'high levels 
(greater than 10 percent of the mass of all bacterial 
proteins), they often fail to fold properly and hence 
■form insoluble inclusion bodies. Protein extracted from 
these inclusion bodies is often biologically inactive. 



- Small proteins can sometimes be refolded into their 
active forms, but larger proteins usually cannot. A 
second problem is that foreign proteins are somedmes 
toxic to bacteria, so cell cultures producing the protein 
cannot be grown to high densides. This problem can 
often be circumvented by using an inducible promoter 
- that is turned on to begin transcripdon of the gene 
for the foreign protein only after the culture has been 
grown. Third, bacterial cells lack. enzymes that are 
present in eukaryodc cells and add posttransladonal 
modifications, such as phosphates and sugars, to pro- 
teins. These modifications are often required for 
proper fiincrioning of proteins. Researchers are ad- 
dressing this problem by purifying die eukaryodc en- 
zymes that carry out these modificadons and using 
these enzymes to add the needed modificadons to 
bacterially expressed proteins. 

Yeast has been used for centuries by brewers and 
bakers, and now it toils for biotechnologists as well. 
As dismissed in Chapter 13, yeast is a simple eukaryote ' 
that resembles mammalian cells in many ways but can 
be grown as quickly and cheaply as bacteria can. Yeast 
perform many of the posttranslational modifications 
found on human proteins and can be induced to secrete 
certain proteins into the growth medium for harvest- 
ing. A disadvantage of yeast is the presence of acdve 
proteases that degrade foreign proteins, thereby re- 
ducing die yield of product. Researchers are dealing 
with this problem, however, by construcdng yeast 
strains in which the protease genes have been deleted. 

Expression of heterologous proteins in insect cells 
by baculovirus vectors (as previously described in Fig- 
ure 12-12) is a relatively new approach. The main 
advantages are high-level expression, correct folding, 
and posttransladonal modifications similar to those in 
mammalian cells. A vaccine for the AIDS virus has 
been prepared by producing one of the HIV glyco- 
proteins with this system. Although the cost of cul- 
niring insect cells is currendy more than that for 
culturing bacteria and yeast, it is less than that for 
culniring mammalian cells. 

Despite the significant advantages of producing hu- 
man proteins in heterologous host cells, in some cases 
the best place to produce a mammalian protein is in' 
mammalian cells. Great improvements have been 
made to promoters, vectors, transformation protocols, 
and host cell systems. Transient expression in mam- 
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malian cells (described in Figure 12-4) is often used 
for checking the jfunction of a newly cloned gene and 
as a quick method for assessing the function of en- 
gineered proteins. The extracellular domains of cell- 
surface receptors (Chapter 17) have been engineered 
for secretion from cells by introducing a stop codon 
into the gene before the transmembrane domain se- 
quence. These soluble receptors are valuable reagents 
for studying ligand binding in vitro and for screening 
for receptor agonists or antagonists, and they may 
eventually be used as therapeutics themselves. Al- 
though transient systems yield enough protein for lab- 
oratory experiments, stably integrated amplified genes 
in mammalian cells are used for the large-scale pro- 
duction of proteins such as tissue plasminogen acti- 
vator, which we describe later. 



Insulin Is the First Recombinant Drug 
Licensed for Human Use 

The first licensed drug produced through genetic en- 
gineering was human insulin. An important hormone 
that regulates sugar metabolism, insulin is produced 
by a small number of cells in the pancreas and secreted 
into the bloodstream. An inability to produce insulin 
results in diabetes, but daily injections of insulin are 
sufficient to reverse or at least allay the debilitating 
effects of the disease. Prior to production of the re- 
combinant molecule, insulin for treatment of diabetes 
was obtained from the pancreases of pigs and cows. 
Although this insulin is biologically active in humans, 
the amino acid sequences are not identical to that of 
the human rnolecule. Thus, some patients produced 
antibodies against injected insulin, occasionally re- 
sulting in serious immune reactions. Because recom- 
binant human insulin is identical to the natural 
product, immunogenicity should not be. a problem. 

In mammals, insulin is expressed as a single-chain 
prepro-hormone, which is secreted through the plasma 
membrane. A prepro-hormone contains extra amino 
acids not present in the mature hormone. Amino- 
terminal amino acids form the pre sequence and target 
the expressed protein for secretion. The pro sequence 
is a stretch of amino acids in the middle of the hormone 
sequence that is important for folding the polypeptide 



chain into the correct structure. During secretion, 
these extra amino acids are cleaved from the prepro- 
hormone by cellular proteases to release the mature 
insulin molecule, consisting of two short polypeptide 
chains, A and B, linked by two disulfide bonds. The 
principal challenge in the production of recombinant 
insulin was getting insulin assembled into this mature 
form. The initial approach was to construct synthetic 
genes from oligonucleotides that separately encoded 
the A and B chains. These were individually inserted- 
into the K coli gene encoding ^-galactosidase, so the 
bacteria produced large fusion proteins that had the 
insulin sequences tacked onto the end of the ^- 
galactosidase enzyme (Figure 23-1). These large pro- 
teins were purified from bacterial extracts, and the 
insulin chains were released by treatment with cyano- 
gen bromide, a chemical that cleaves peptide bonds 
following methionine residues. Because a methionine 
codon had been inserted at the boundaries between 
j?-galactosidase and the insulin chains in the fusion 
proteins, cyanogen bromide treatment clipped intact 
insulin chains off the fusion proteins. These were pu- 
rified, mixed, and reconstituted into an active insulin 
molecule. This approach was refined by producing a 
single ^-galactosidase— insulin fusion protein, which 
could be cleaved in a single step to release mature 
insulin. A similar method is now in use for the com- 
mercial production of recombinant insulin. 



Recombinant Human Growth 
Hormone is Produced in 
Bacteria by Two Methods 

Growth hormone is a 191 amino acid protein that is 
produced in the pituitary gland and regulates growth 
and development. Children born with growth hor- 
mone deficiency — hypopituitary dwarfs — never 
achieve normal stature. Regular injections of growth 
hormone stimulate the growth of these children so 
that they reach near-normal heights. Unlike the sit- 
uation with insulin, animal-derived growth hormones 
are ineffecdve. Only the -human protein works, and 
for many years it was painstakingly, extracted from the 
pituitaries of human cadavers. One unforeseen and 
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unfortunate consequence of growth hormone treat- 
ment, however, was the infection of a number of chil- 
dren with a fatal virus from one of the cadavers. 
Production of recombinant human growth hormone 
(hGH) would clearly provide a safe, reliable, and plen- 
tiful source of this drug. 

The initial production of hGH was achieved by 
constructing a hybrid gene from the natural hGH 
cDNA and synthetic oligonucleotides that encoded 
the amino terminus of the mature form of the protein 
(Figure 23-2a). This coding sequence was ligated into 



FIGURE 23-1 

Expression of human insulin in £ coli Recombinant insulin 
was first made by expressing the A and B chains separately 
then refolding them into a mature insulin molecule. A DNA 
fragment encoding each insulin chain was made by anneal- 
mg two complementary oligonucleotides that had been • 
chemically synthesized Each fragment was ligated into a 
bacterial expression vector so diat, when translated the in- 
suhn chain would be fbsed to the carboxy terminus of the 
enzyme ;?-galactosidase (^-gal). The expression vectors 
were transformed into £ coli, and the ^-gal-insulin fUsion 
proteins accumulated inside the bacterial cells. The cells 
were harvested, and each ;?-gal-insuIin SfU^ion protein was 
purified. The msuhn-coding DNA was synthesized so' that it 
started with a mediionine codon. This setup provided a way 
to cleave off the ^-gal paa from die insulin polypep.dde 
Treatment of the ftision protein widi die chemical cyanogen 
bromide (CNBr) results in cleavage of peptide bonds after 
all methionmes. In diis way, the natural insulin peptides 
were obtained. Because ^-gal contains other mediionine res- 
idues, CNBr treatment cleaved it into many small peptides 
The msuhn chains were not cleaved further because they 
did not contain internal methionines. The A and B chains 
were purified and dien mixed together to form active re- 
combinant insulin. 



a plasniid adjacent to a bacterial promoter. Like insulin, 
hGH is normally produced as a larger precursor pro- 
tein containing an amino-terminal signal sequence. 
Because die human signal sequence would not be rec- 
ognized by the bacterial secretion machinery, the 5' 
end of the cDNA was reengineered with a synthetic 
DNA sequence enabling die bacteria to produce a 
nearly normal version of the mature human protein. 

The first hGH expression vectors'directed the pro- 
duction of the protein inside the cell. Purification re- 
quired many steps to separate hGH from die thousands 
of intracellular bacterial proteins. Another way to pro- 
duce the protein in bacteria is to engineer die protein 
so it is secreted. This can be done by linking the coding 
sequence for the desired protein to a signal sequence 
from a secreted bacterial protein, thus forming a pre- 
hormone (Figure 23-2b). Human growth hormone is 
produced by the bacteria and dien secreted with die 
concomitant removal of the signal peptide by a bac-^ 
terial protease. Secretion into the periplasm, where 
there are fewer proteins than inside the cell' makes 
purification simpler. The only difference between the 
secreted hGH and that produced intracellularly is the 
presence of an amino-terminal methionine on the in- 
tracellularly expressed molecule. Because the secreted 
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FIGURE 23-2 

Bacrerial production of human growth hormone (hGH). (a) An expression vector was con- 
structed for intracellular producrion of hGH. The coding sequence was constructed by iso- 
lating from the cDNA a DNA fragment that encoded amino acids 24-191 and hgating this 
to a synthetic oligo nucleotide fragment that encoded amino acids 1—24. Following introduc- 
tion of the expression vector into bacterial cells, recombinant hGH was produced inside the . 
cells. The expressed protein behaved just like natural human growth hormone but contained 
the initiator methionine at the amino terminus, (b) A protein can be produced in bacteria 
without this extra methionine by targeting it for secretion. To do this, a DNA fragment 
encoding a bacterial signal sequence, which specifies secretion of a bacterial protein, was 
placed in front of the hGH coding sequence. Upon introduction of this vector into bacteria, 
hGH is produced, and the signal sequence targets the protein for secretion. The protein 
accumulates in zhk periplasmic j-^ar^ between the inner and outer bacterial membranes and 
can be released by hypotonic disruption of the outer membrane. In contrast to the intracellu- 
lar form of hGH, the protein produced by this procedure does not contain an initiator me- 
thionine, since a periplasmic protease cleaved off the signal sequence. 
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-form lacks this methionine, it is called met-less hGH. 
Bacterially expressed hGH has been administered to 
thousands of growth hormone-deficient children, who 
hav^ benefited greatly from this recombinant drug. 



A Hepatitis B Virus Vaccine Is - 
Produced in Yeast by Expression 
of a Viral Surface Antigen 

One of the successes of modern medicine is the de- 
velopment and implementation of vaccines against in- 
fectious diseases. Prior to the advent of recombinant 
DNA technology, two types of vaccines were used. 
Inactivated vaccines are chemically killed derivatives 
of the actual infectious agent. Attenuated vaccines are 
hve viruses or bacteria altered so that they no longer 
multiply in the inoculated organism. Both types of 
vaccines work by presenting surface proteins (anti- 
gens) to B and T lymphocytes, which become primed 
to respond rapidly should the organism actually be- 
come infected, usually destroying the infectious agent 
before any damage is done (Chapter 16). However:, 
these types of vaccines are potentially dangerous be- 
cause they can be contaminated with infectious or- 
ganisms. For example, a small number of children each 
year contract polio from their polio vaccinations. Thus, 
one of the most promising applications of recombinant 
DNA technology is the production of suhunit vaccines, 
consisting solely of the surface protein to which the 
immune system responds.. With a subunit vaccine, 
there is no risk of infection. ' 

The first successful subunit vaccine was produced 
for hepatitis B virus (HBV), which infects the liver 
and causes liver damage and, in some cases, cancer. 
The virus particle is coated with a surface antigen, 
HBsAg, and infected patients carry large aggregates 
of this protein in their blood. Early experiments sug- 
gested that these aggregates' would make a potent 
vaccine, but how could they be produced in quantities 
sufficient to vaccinate large populations against HBV.? 
With the cloning of the HBV genome, the possibility 
of a subunit vaccine could be explored. Initial attempts 
to produce the HBsAg protein in E. coli failed, so 
researchers turned to yeast. The HBsAg gene was 
mserred into a high-copy yeast expression vector (Fig- 



ure 1 3-3) and engineered, in this case, so that it would 
not be secreted (Figure 23-3). Yeast transformed with 
this plasmid produced large quantities of the viral 
protein (about 1-2 percent of the total yeast protein). 
By growing the yeast in large fermentors, it was pos- 
sible to produce 50-100 mg of the protein per liter 
- of culture. This recombinant protein closely resembled 
the natural viral protein; it even formed aggregates 
with properties sirhilar to those of the immunogenic 
aggregates found in infected patients. The yeast pro- 
tein is now used commercially to vaccinate people 
against HBV infecdon. 

Vaccines against many human and animal patho- 
gens are currendy in various stages of development 
Recombinant DNA technology has provided a safe 
means to work with and to inoculate children and 
adults with only noninfecuous parts of infectious 
agents. In Chapter 25, we will discuss various strategies 
for the development of a vaccine against the AIDS 
virus. 



Complex Human Proteins Are 
Produced by Large-Scale 
Mammalian Cell Culrure 

Most of the recombinant proteins we have discussed 
thus far in this chapter are relatively small and simple 
m.both structure and function. Other proteins of med- 
ical interest are considerably more complicated in 
structure and function, and biologically active proteins 
have proved difficult to produce in bacteria and yeast 
In these cases, biotechnology companies have resorted 
to using mammalian cells for protein production. 
Mammalian cells are finicky and expensive to grow, 
but they can be counted on to produce correcdy mod-, 
ified, fiilly active proteins. Thus, much effort in the 
biotechnology industry has been devoted to setting up 
fermentor systems for large-scale culture of mam- 
malian cells. 

The fitst drug to be produced commercially by 
mammalian cell culture was tissue plasminogen activator 
or tPA, which is administered to heart attack victims. 
Tissue plasminogen activator is a protease, an enzyme 
that cleaves other proteins. It works by clipping plas- 
minogen, an inactive precursor protein, to form plasmin, 
itself a potent protease that degrades fibrin, the protein 
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FIGURE 23-3 

Production of a subunit vaccine in yeast. Hepatitis B virus 
(HBV) is encoded by a small 3^2-kb genome that has been 
cloned and sequenced. Both the whole virus and a smaller 
HBsAg (HBV surface antigen) particle are found in the 
blood of infected patients. To prepare a vaccine against 
HBVi which has been difficult to propagate in culture, the 
HBsAg gene was cloned into a vector for expression in the 
yeast Saccbaromyccs cercuisiac. Transcription occurs from the 
strong promoter from the gene encoding alcohol dehydro- 
genase I. A transcription terminator was placed downstream. 
The vector contains replication origins and markers for both 
bacteria and yeasL Yeast transformed with this plasmid can 
be grown to high cell densities in fermentors. This process 
results in the accumulation of large amounts of HBsAg 
protein, which upon purification was found to aggregate 
into particles about 20 nanometers in diameter, resembling 
the particles found iii HBV-infected patients. 



that forms blood clots. Rapid administration of a plas- 
minogen activator after a heart attack dissolves the 
life- threatening clots that lead to irreversible darnage 
of heart; muscle. Tissue plasminogen activator is com- 
mercially produced from a mammalian cell -line car- 
rying a stably integrated, highly amplified expression 
vector (Figure 23-4). 

Another protein being produced by mammalian cell 
culture is Factor VIII, a protein required for normal 
clotting of the blood. Genetic defects in Factor VIII 
production are responsible for hemophilia. For many 
years, hemophiliacs have been treated with Factor VIII 
purified from human blood. With- the contamination 
of the human blood supply by the AIDS virus, how- 
ever, thousands of hemophiliacs became infected and 
hundreds died from AIDS. The Factor VIII cDNA 
had already been cloned before scientists found that 
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the blood supply was contaminated with the AIDS 
virus. Recognition of the need for a safer source of 
Factor VIII accelerated efforts already under way to 
produce the protein using recombinant DNA methods. 
Like tPA, Factor VIII is a large and complex protein 
and can only be efficiently produced in mammalian 
cell culture. But the availability of recombinant protein 
will spare fumre generations of hemophiliacs from 
infectious agents that contaminate the blood supply. 



Monoclonal Antibodies Function 
as "Magic Bullets" 

We have discussed the use of biotechnology to pro- 
duce novel vaccines that elicit antibody production 
by the body's immune system. As we learned in Chap- 
ter 16, antibodies are exquisitely selective proteins that 
can bind to a single target among millions of irrelevant 
sites. Researchers have long dreamed of harnessing the 
specificity of antibodies for a variety of uses that re- 
quire the targeting of drugs and other treatments to 
particular sites in the body. It is this use of antibodies 
as targeting devices that led to the concept of the 
"magic bullet," a treatment that could effectively seek 
and destroy nimor cells and infectious agents wherever 
they resided. 

> The major limitation in the therapeutic use of an- 
tibodies is producing a useful antibody in large quan- 
tities. Initially, researchers screened myelomas, which 
are antibody-secreting rumors, for the production of 



FIGURE .23-4 

Prodticrion of tissue plasminogen, activator (tPA) by mam- 
mahan cell culture. The cloned cDNA for human tPA was 
hgated mto an expression vector that contained a strong 
promoter and terminator. The vector was stably transfected 
mto a mammalian cell line. The initial transformants 
secreted tPA into the culture medium, but the level of 
expression was very low. Cell lines that expressed tPA to 
high levels were obtained using methotrexate treatment 
which selects for cells that have amplified the dhfr gene' 
resident in die vector together with the linked tPA expres- 
sion cassette (Chapter 12). High-expressing lines are grown 
in. large fermentors and recombinant tPA is purified from 
the culture medium. 
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useful antibodies. But they lacked a means to program 
a myeloma to produce an antibody to their specifi- 
cations. This situation changed dramatically with the 
development of monoclonal antibody technology. The 
procedure for producing monoclonal antibodies, or 
MAbs, is shown in Figure 23.-5. First, a mouse or rat 



FIGURE 23-5 

Production of a monoclonal antibody (MAb). A mouse is 
inoculated with an antigen (Ag) of interest. This stimulates 
the proliferation of lymphocytes expressing antibodies 
against the antigen. Lymphocytes are taken from the spleen 
and fused to myeloma cells by treatment with polyethylene" 
glycol. Hybrid cells are selected by growth in HAT medium 
(Chapter 12). The myeloma cells lack the enzyme HPRT 
and thus die in this medium unless they become fused with 
a lymphocyte, which expresses the missing enzyme. Unfiised 
lymphocyte cells soon die off as well, because they do not 
grow for long in culture. Individual hybrid cells are 
transferred to the wells of a microtiter dish and cultured 
for several days. Aliquots of the culture fluids are removed 
and tested for the presence of antibody (Ab) that binds the 
antigen. Cells that test positive are cultured for monoclonal 
antibody production. Antibody-producing cell lines are 
stored frozen in liquid nitrogen (this process is called cell 
banking. Aliquots can be thawed out and cultured as needed. 

is inoculated with the antigen to which an antibody 
is desired. After the animal mounts an immune re- 
sponse to the antigen, its spleen, which houses anti- 
body-producing cells (lymphocytes), is removed, and 
' the spleen cells are fused en masse to a specialized 
myeloma cell line that no longer produces an antibody 
of its own. The resulting fused cells, or bybridomas, 
retain properties of both parents. They grow contin- 
uously and rapidly in culture like the myeloma cell, 
yet they produce antibodies specified by the lympho- 
cyte from the immunized animal. Hundreds of hy- 
bridomas can be produced from a single fusion 
experiment, and they are systematically screened to 
identify those producing large amounts of a desired 
antibody. Once identified, this antibody is available in 
limitless quantities. Monoclonal antibodies are already 
widely used for the diagnosis of infections and cancer 
and for the imaging of tumors for radiotherapy. And 
investigations into their use in the direct treatment of 
cancer, inflammation, and immune disorders is on the 
rise. 

Human Antibodies That Recognize 
Specific Antigens Can Be Directly 
Cloned and Selected 

One new application of monoclonal antibody tech- 
nology is the generation of abzymes, antibodies that 
behave like enzymes to catalyze a chemical reaction. 
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FiGURE 23-6 

Direcr cloning of antibody cDNAs by PGR. To engineer an 
antibody, the amino acid sequence of the variable domain 
needs to be determined. This could be done by sequencing 
a purified preparation of the heavy- (H) and light- (L) chain 
proteins, but a simpler method is to deduce the sequence 
from the cloned cDNA. In the -past, a cDNA library was 
prepared from hybridoma mRNA and screened with probes 
from the constant regions of the H and L chain genes. A 
simpler method has been developed that uses the PGR. 
From a comparison of a large number of antibody 
sequences, amino acids frequently found at rhe amino 
termini of antibodies were identified. From this information, 
a set of degenerate PGR primers was designed that cor- 
respond to all the possible sequences in this region. Because 
the amino acids in the constant domains of different anti- 
bodies are nearly identical, only one PGR primer is needed 
for the y end of each H and L chain sequence. To direcdy 
clone the annbody cDMAs, cDNA is prepared by treadng- 
hybridoma mRNA with reverse transcriptase, mixed with a 
pair of PGR primers (in this case, for amplifying the 
heavy chain sequences), and subjected to PGR. Without 
knowledge of the amino terminus of the anubody chain, a 
PGR had to be set up with each of the different 5' primers 
until an amplified DNA fragment was obtained. The pro- 
cess can be simplified if the sequence of the first six or 
seven amino acids of the antibody can be determined; this 
is sufficient to design a single 5' PGR primer. ' 



Enzymes catalyze reactions by stabilizing a chemical 
structure intermediate between the substrate and 
product, termed the transition state. Thus, if monoclonal 
antibodies could be made to a transition state ana- 
logue — a molecule resembling the transition state of 
a chemical reaction — then some of these antibodies 
might have catalytic activity. The ability to produce, 
custom-designed catalysts would be very valuable, es- 
pecially to the chemical and pharmaceuticalindustries. 

Initial attempts to produce catalytic antibodies in- 
dicated that they were exceedingly rare and often not 
found among the hybridomas produced by conven- 
tional monoclonal antibody technology. An excellent 
fusion might produce several hundred different an- 
tibodies, but the entire repertoire of antibodies that 
can be produced by the immune system is perhaps 
100 milhon. How can the entire repertoire be tapped.^ 
One strategy, that shows promise is to bypass, the 
inefficient fusion step in hybridoma production and 
directly clone antibody cDNAs from the lymphocytes 
of immunized mice (Figures 23-6 and 23-7). Inves- 
tigators inoculated a mouse with an antigen. They 
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recovered spleen cells from the mouse and used PCR 
to ampHfy millions of cDNAs for antibody light and 
heavy chains. The hght- and heavy-chain cDNAs were 
cloned separately into phage vectors and then recom- 
bined in vitro to generate a third, combinatorial MhTzry 
of phage carrying random pairs of light and heavy 
chains. The library was plated onto a bacterial lawn, 
and the resulting phage plaques, each containing a 
unique antibody, were screened with radioactively la- 
beled antigen in a manner similar to that used for 
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FIGURE 23-7 

Creating a combinatorial library of antibodies expressed in 
. £. coii Lymphocytes or spleen cells are removed from an 
immunized animal. mRNA is obtained and cDNA is synthe- 
sized with reverse transcriptase. The heavy- (H) and light- 
(L) chain genes are separately amplified by PGR, as shown 
in Figure 23-6, and ligated into 2. cloning vectors. Two dif- 
ferent libraries are produced, one containing the H chain 
genes and one containing the L chain genes (this step has 
been omitted from the figure for simplicity). Phage DNA is 
isolated from each library, and the H and L chain sequences 
are ligated together and packaged to form a combinatorial 
library. Each phage now contains a random pair of H and 
L chain cDNAs and thus upon infection of K coli directs 
the expression of the two antibody chains in infected cells. 
Since the H chain sequence contains only the variable 
region and the first constant domain, the antibody that 
forms is called a Fab, for antigen binding fragment It binds 
the antigen much like an intact antibody but it lacks the 
effector domain. To identify an antibody that recognizes the 
antigen, the phage library is plated, and the antibody (Fab) 
molecules present in the plaques are transferred to filters. 
The filters are incubated with radio actively labeled antigen 
and then washed to remove excess unbound ligand. A 
radioactive spot on the autoradiogram identifies a plaque 
that contains an antibody that binds the antigen. A recent 
procedure uses the phage display technology, described in 
Figure 23-10, to select antibodies with desired properties. 
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cloning cDNAs from an expression library (Figure 
7-10). Out of a million phage plaques screened, 200 
clonp were identified that produced an antibody bind- 
ing the antigen. Thus, with this approach, investigators 
were able to sample a million possible antibodies — 
at least a thousand times more than they could screen 
by., conventional monoclonal antibody technology. 
Since phages in a particular plaque encode the anti- 
body expressed in the plaque, it is a trivial matter to 
clone the heavy- and light-chain cDNAs from the 
phage DNA. These cDNAs can be placed into bac- 
terial or mammalian expression vectors foi: production 
of large quantities of the selected antibody. 

A recent modification of this method uses filamen- 
tous phages such as Ml 3 instead of A phage and allows 
display of the antibodies on the phage surface. This 
offers the advantage of being able to screen thousands 
more phage (because the screening can.be done, in 
solution) and to select phage that express tight-binding 
antibodies. We will discuss this method later and in 
Figure 23-10. 



"Humanized" Monoclonal Antibodies 
Retain Activity But Lose 
Immnnogenicity 

Although swift progress is being made in the identi- 
fication of monoclonal antibodies with potential ther- 
apeutic value, their use is limited by a problem we 
have already discussed in this chapter. Monoclonal 
antibodies are usually mouse proteins, and they are 
not identical to human antibodies. Thus, antibodies 
injected into a patient will eventually be recognized 
as foreign proteins and will be cleared from the 
circulation. 

As we learned in Chapter^ 16, both chains of the 
antibody molecule can be divided into variable and 
constant regions. The variable regions differ in se- 
quence from one antibody to another, and this is the 
region of the protein that binds the antigen. The con- 
stant region is the same among all antibodies of the 
same type. The first method used to reduce the im- 
munogenicity of a mouse monoclonal antibody was 
simply to construct chimeric genes that encoded pro- 
teins in which the variable regions from the mouse 
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Antibody engineering. The basic scrucmre of a mouse mono- 
clonal anubody (MAb) resembles that of a human antibody. 
Howrever, there are numerous differences between amino 
acid sequences of the antibodies from the two species. 
These sequence differences account for the immunogenicity 
of mouse MAbs in humans. A chimeric MAb is constructed 
by ligating the cDNA fragment encoding the mouse Vl and 
Vh domains to fragments encoding the C domains from z 
human antibody. Because the C domains do not contribute 
to anugen binding, the chimeric antibody will retain the 
same antigen specificity as the original mouse MAb but will 
be closer to hum^n antibodies in sequence. Chimeric MAbs 
still contains some mouse sequences, however, and may still 
be immunogenie A humanized UAh contains only those 
mouse amino acids necessary to recognize the antigen. 
This product is constructed by building into a human 
antibody the amino acids from the mouse complementarity 
determining regions or CDRs. 



antibody were fused to the constant regions from a 
human antibody. The chimeric antibody (Figure 
23-8) retained its binding specificity but more closely 
resembled a natural human andbody. 

This antibody, however, was not fully humanized, 
because it retained amino acid sequences from the 
mouse protein. Thus, scientists have, set out to engi- 
neer fully humanized monoclonal antibodies that will 
be indistinguishable from natural molecules. Extensive 
studies of the three-dimensional structures of antibody 
molecules tell us that only a few of the one hundred 
amino acids in the variable region of an antibody 
actually contact the antigen; these regions of contact 
are referred to as complementarity determining regions 
(CDRs). Three CDRs each comprise the antigen- 
binding sites on the light and heavy chains. The rest 
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of the variable region serves as a scaffold to anchor 
the CDRs in the correct positions. This breakdown 
of amino acids in the variable region into those serving 
recognition and those serving structural roles is also 
evident from simply corriparing the sequences of many 
antibody molecules. Amino acid sequences in the 
CDRs are hypervaHable, whereas the structural, or 
framework, amino acids differ little. 

Thus, to make a fully humanized antibody, all that 
would be required in principle would be to use in 
vitro mutagenesis to transfer the CDR amino acid 
sequences from a mouse MAb to a natural human 
antibody (Figure 23-8). This method was used to hu- 
manize an antibody that recognizes an antigen on the 
surface of human lymphocytes. This humanized MAb 
is now in clinical trials as an immunosuppressant and 
for treatment of lymphoid tumors. Another potentially 
valuable MAb binds a growth factor receptor found 
in large numbers on the surface of many breast tumor 
cells. Laboratory experiments showed that this anti- 
body could block the growth of these cells in culture 
and caused tumors seeded in mice to regress. Unfor- 
tunately, the first humanized versions of this antibody 
bound the receptor protein but failed to block the 
growth of breast carcinoma cells. Investigators sus- 
pected that the problem was with the framework amino 
acids, and they useid computer modeling to design 
amino acid substitutions that would strengthen the 
antibody- antigen interaction. Several such variant an- 
tibodies were produced and tested; one bound the 
receptor 250 times more tightly than did the original 
antibody and successfully blocked tumor cell growth 
in culture. This antibody is now being produced in 
large quantities for clinical trials. 



Protein Engineering Can Tailor 
Antibodies for Specific Applications 

Humanizing monoclonal antibodies is an example of 
the emerging technology oi protein engineerings that is, 
a process using recombinant DNA to modify the struc- 
ture of natural proteins to improve or change their 
function. Antibodies are particularly attractive can- 
didates for protein engineering, because their structure 



is understood in great detail and because their poten- 
tial for use in medicine is enormous. Another way in 
which antibodies are being engineered is by changing 
their effector domains, the regions of the heavy chain 
that specify antibody function — for example, killing 
of cells marked by the antibody. In this way, the mode 
of action of a monoclonal antibody can be repro- 
grammed. One promising strategy is to replace the 
effector domain entirely with a sequence encoding' a 
toxin. An antibody- toxin fusion protein would deliver 
the toxin specifically to cells bearing the target antigen. 
This product could be an exceptionally potent treat- 
ment for cancer and for viral diseases such as AIDS. 
Antibody engineering is also being used to construct 
hispecific antibodies. In these antibodies, each of the two 
arms recognizes a different antigen, thus allowing an 
antibody to bridge the two antigens. For example, a 
hispecific antibody could recognize a tumor cell pro- 
tein with one arm and a protein on the surface of a 
killer T cell with the other, thereby bringing the killer 
cells directly to the tumor (Figure 23-9). 



Protein Engineering Is Used to 
Improve a Detergent Enzyme 

Subtilisin is a serine protease produced by bacteria. 
Due to its broad specificity for proteins that commonly 
soil clothing, this enzyme was developed for com- 
mercial use in laundry detergents. (It is subtilisin that 
is prominently advertised as the enzyme additive in 
modern detergents.) But the first detergents containing 
subtilisin suffered from a serious drawback: they could 
not be used with bleach, because bleach inactivates 
the enzyme. Biochemical analysis determined that loss 
of activity was due to the oxidation of a methionine 
at position 222. Once this happened, the modified 
enzyme lost 90 percent of its activity. Because they 
knew which amino acid was bleach sensitive, however, 
scientists decided to see whether a variant of subtilisin 
could be produced that was no longer sensitive to 
bleach. . 

To do this, site-directed mutants were constructed 
in the gene encoding subtilisin. The strategy was sim- 
ply to substitute, one at a dme, each of the non— wild- 
type amino acids at residue 222. The mutant genes 
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even more active than the wild-rype protein, but it 
was also inactivated by bleach. The next most active 
variant was the alanine-substituted enzyme, which was 
53 percent as active as wild-type subtilisin. This vari- 
ant exhibited no detectable bleach sensitivity, so de- 
tergents containing this engineered subtilisin can now 
be used with bleach. This new variant of subtilisin is 
an example of ^, second-generation molecule, a molecule 
specifically engineered for a new desirable trait. Pro- 
tein engiheers are currently at work on a third-gen- 
eration molecule that exhibits decreased temperature 
sensitivity so that it can be used in hot water. 

This experiment points out the power of recom- 
binant DNA as a tool for the engineering of namr al 
^^^^^^^^•^^^"^^^^Z^^^ of a pfoteiii wasTU 
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FIGURE 23-9 

A bispecific andbody. By using recombinant DNA, the 
cDNAs for antibodies to two difFerent antigens caA be engi- 
neered to make an antibody in' which each arm recognizes a 
.different anngen. Thus ir is possible ro recombine andbodies 
to surface anugen on rumor cells and to a protein on cyto- 
roxiG T cells to make a bispecific andbody that brings the 
two cells together ro facilitate killing of the tumor cells 



were cloned into expression vectors and the 19 dif- 
ferent subtilfsin derivatives were expressed. Biochem- 
ical analysis showed that the cysteine-222 enzyme was 



Growth Hormone Variants 
with Improved Binding Are 
Selected by Phage Display 

To engineer an improved subtilisin enzyme, research- 
ers were aided by die knowledge that only one s^^cihc 
amino acid had to bfe changed. Thus, they could sys- 
tematically vary diat amino acid to find the one that 
worked the besL But more complex challenges face 
protein engineers. Is it possible, for example, to en-- 
gineer' antibodies with higher affinity for antigen; to 
design an inhibitor that tightly binds to and blocks a 
cell-surface protein or an enzyme inside a ceil; to 
generate a growth factor or hormone with increased 
affinity for its receptor? Alterations of this sort require 
several amino acid changes, and with 20 possible amino 
acids at each position, the number of variants that 
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need to be screened is enormous (for changes at just 
3 amino acids, there are 8000 different combinations; 
for 10 amino acids, 10*^ different proteins are possible). 
Clearly, these. variants cannot be made and tested one 
at a time, and a method for direct selection of improved 
proteins is needed. 

Researchers have used a new approach to select 
variants of human growth hormone with increased 
affinity for growth hormone receptor (Figure 23-10). 



FIGURE 23-10 

Expression of proteins and pepddes on the surface of 
filamentous phage. A library of randomly mutated hGH 
cDNAs was ligated into an Ml3-based phagemid vector so 
that hGH was fiased to the carboxy-terminal domain of the 
Ml 3 gene III protein. The carboxy terminus of the gene III 
protein associates with the phage particle, and the amino 
terminus, containing the HGH variants, is displayed on the 
outer surface of the phage. The library of phagemids is 
introduced into E. coli, and ampicillin-resistant colonies are 
obtained. These £ coli are then infected with a helper phage 
that induces the production of phagemid particles. Only 1 — 
10 percent of the phage particles contain an HGH-gene III 
fusion protein, and these contain only one hGH fusion 
molecule per phage. This ensures that the phage retain 
sufficient wild-type gene III protein in their coats to 
remain infectious. hGH-phage were passed through a 
column containing the hGH receptor covalently linked to 
plastic beads. Only the phage expressing hGH were re- 
tained. The nonbinding phage lacking hGH passed through 
the column. The bound phage were isolated, cultured in 
E coli, and passed, again over the column. Repeated rounds 
of selection resulted in the identification of hGH variants 
that bound the receptor with exceptionally high affinity. 



From structural smdies and extensive mutagenesis of 
hGH, they knew what portions of the amino acid 
sequence were important for receptor binding- They 
synthesized degenerate oligonucleotides that encoded 
all possible amino acids at these positions and ligated 
the pool of oligonucleotides in place of the natural 
hGH sequence. The resulting pool of variant hGH 
cDNAs was fused to the reading frame of gene III in 
the filamentous phage Ml 3. Gene III encodes a minor, 
phage coat protein expressed on the surface of the 
phage, and incorporation of the hGH cDNA into this 
gene results in the display of the hGH variants on the 
surface of the phage, one variant per phage. This tech- 
nique is known as phage display. 

Now it was a simple matter to pass this library of 
more than 10" different phage over a column con- 
taining the hGH receptor. Phage displaying weakly 
binding hGH variants were washed off the column, 
and phage displaying tightly binding variants were 
recovered with a more stringent wash This population 
of tight-binding phage was amplified by infection of 
£ coli and passed over the column a second time. The 
selection was repeated for a total of six rounds, each 
round enriching for the phage displaying hGH variants 
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With highest affinity for the receptor bound to the 
column. At this poini:, individual phage were cloned, 
the affinities of their hGH variants were measured 
directly, and che sequences of the hGH cDNAs were 
examined. Among these variants was one that bound 
its receptor about 10 times more tighdy than natural 
hGH did. When selected amino acids from another ' 
region of hGH that had been randomized were in- 
troduced into this variant, the resulting hGH molecule 
bound to the hGH receptor over 50 times more tightly 
than the wild-type hGH did. This process is being 
repeated in the hope to obtain even more tighdy bind- 
ing variants. 

The abihty afforded by techniques such as phage 
display to correlate protein structure and function in 
a systematic way makes possible new methods of find- 
ing novel drugs- If researchers have a good idea what 
combination of amino acids gives the best fit to the 
binding site on a receptor, the next step in rational 
drug design would be to design, or even select, a small 
peptide that binds as well as the larger protein. And 
then, using computer modeling to display the molec- 
ular contacts between ligand and receptor, researchers 
can attempt to design and synthesize small nonprotein 
molecules that make the same contacts. The end- 
product would be a small organic molecule that could 
be produced more cheaply than a recombinant protein, 
yet would retain the full biological activity of the 
protein hormone. And, more important, such mole- 
cules could be administered orally, thus eliminating . 
the major disadvantage of most recombinant protein 
therapeutics — that they must be delivered directly 
into the bloodstream by injection. This type of rational 
drug design contrasts sharply with the conventional 
approach to drug discovery now in use in the phar- 
maceutical industry, -in which an inventory of com- 
pletely unrelated compounds is tested at random until 
ah active compound is fouiad. 



New Technologies Promise New 
Approaches to Drag Design 

The biotechnology industry is in its infancy, and its 
SMcctssQS to date follow direcdy from developments 
in molecular biology that are already nearly two de- 
cades old. The recombinant drugs currendy in clinical 
use arise from what is by now conventional technol- 
ogy — gene cloning, expression, and mutagenesis to 
improve protein function. These methods will con- 
tinue to turn out new drugs such as erythropoietins 
to treat anemia caused by kidney disease, DNase to 
treat cystic fibrosis, or colony-stimulating factors 
(CSFs) to increase white blood cell production during 
chemotherapy. 

But the true promise of biotechnology is in novel 
technologies that are only now being developed. We 
have mentioned efforts to design catalytic antibodies 
that can accelerate chemical reactions in both medical 
and industrial applications. This is but one example 
of a whole new approach to protein engineering in 
which novel activities can be placed on unrelated pro- 
tein scaffolds, using random mutagenesis coupled with 
selection methods like phage display. Similar goals 
may be achieved by the engineering of r/^zjyTw^j-, RNA 
molecules with catalytic activity, and the use of the 
polymerase chain reaction to select nucleic acid mol- 
ecules that bind tightly to targets of medical impor- 
tance. ' Another strategy that may see widespread 
application is treatment with antisense DNA and RNA 
to inhibit the expression of oncogenes in tumors or of 
viral genes in infected patients. And a variety of new 
technologies based on viral vectors promise new ap- 
proaches for vaccines and gene therapy. 

Many of these techniques now work in the test 
tube, and the principal challenge facing biotechnology 
companies is to turn these laboratory .techniques into 
commercially viable processes. 
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